Role of Categorical Variables in Multicollinearity in the Linear Regression Model
نویسندگان
چکیده
The present article discusses the role of categorical variable in the problem of multicollinearity in linear regression model. It exposes the diagnostic tool condition number to linear regression models with categorical explanatory variables and analyzes how the dummy variables and choice of reference category can affect the degree of multicollinearity. Such an effect is analyzed analytically as well as numerically through simulation and real data application.
منابع مشابه
بهکارگیری متغیرهای پنهان در مدل رگرسیون لجستیک برای حذف اثر همخطی چندگانه در تحلیل برخی عوامل مرتبط با سرطان پستان
Background and Objectives: Logistic regression is one of the most widely used generalized linear models for analysis of the relationships between one or more explanatory variables and a categorical response. Strong correlations among explanatory variables (multicollinearity) reduce the efficiency of model to a considerable degree. In this study we used latent variables to reduce the effects of ...
متن کاملRobust Estimation in Linear Regression with Molticollinearity and Sparse Models
One of the factors affecting the statistical analysis of the data is the presence of outliers. The methods which are not affected by the outliers are called robust methods. Robust regression methods are robust estimation methods of regression model parameters in the presence of outliers. Besides outliers, the linear dependency of regressor variables, which is called multicollinearity...
متن کاملForecast generation model of municipal solid waste using multiple linear regression
The objective of this study was to develop a forecast model to determine the rate of generation of municipal solid waste in the municipalities of the Cuenca del Cañón del Sumidero, Chiapas, Mexico. Multiple linear regression was used with social and demographic explanatory variables. The compiled database consisted of 9 variables with 118 specific data per variable, which were analyzed using a ...
متن کاملInstrumental Variables Regression with Measurement Errors and Multicollinearity in Instruments
In this paper we obtain a consistent estimator when there exist some measurement errors and multicollinearity in the instrumental variables in a two stage least square estimation of parameters. We investigate the asymptotic distribution of the proposed estimator and discuss its properties using some theoretical proofs and a simulation study. A real numerical application is also provided for mor...
متن کاملارزیابی حساسیتپذیری فرسایش آبکندی با استفاده از رگرسیون لجستیک، در حوضه صلواتآباد استان کردستان
Introduction Gully is one of the forms of water erosion features in many regions of the world. Gully erosion could produce abundant sediment load, reducing the fertility of land and destruction of structures. In literature, for the preparation of landslide susceptibility mapping, several studies have been conducted using logistic regression. But, a few authors focused on gully erosion suscep...
متن کامل